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Prediction and Retrodiction 



by Satosi Watanabe 

An attempt is made within the framework of the accepted quantum 
physics to achieve the maximum parallelism between prediction 
(inference of the future observational data from the present 
ones) and retrodiction (inference of the past observational data 
from the present ones). To implement this program* it is shown 
that the "retrodictive state function" (extrapolation of the pre- 
sent data to the past) can be just as useful as the ordinary 
"predictive state function" (extrapolation of the present data to 
the future). This leads to a formalism in which time-reversal 
becomes a linear transformation and double time-reversal becomes 
a c-number. In spite of all this formal symmetry* it can be 
shown that the actual success of a retrodiction depends on the 
satisfaction of an additional condition which is not required in 
prediction* and which is not always fulfilled. From the same 
point of view* a logical loophole is pointed out in the indis- 
criminate application of the H- theorem to the past. The so-called 
irreversibility of observation is interpreted in terms of the de- 
crease of "information" in the process of inference. 



#1» Introduction 



In accordance with its expected role in human activities, physical 
theory is pre-eminently a predictive instrument. Man is, however, not 
immune to temptation of the advanture of guessing with the same instru- 
ment what happened in the past outside the reach of his own observation. 

In the non-statistical domain of classical physics, retrodiction must be 
in principle just as successful as prediction. However, in statistical 
applications of classical physics and in quantum physics, a careful study 
is needed to determine the confirmability of an attempted retrodiction. 

The present paper is wished to provide an answer to some of the rudimentary 
questions in this rather neglected field of intellectual interest. Al- 
though some new points of view and a new formalism are introduced, the 
content of this paper will remain perfectly faithful to the accepted pre- 
mises of classical and quantum physics. It should be noted that retrodic- 
tion is a question defined differently from the so-called time-reversal, 
although it is related to this in a certain way which will become clear 
in our Sections 3 and 5 . 

There have been at least three circumstantial incentives which mo- 
tivated undertaking this work. In the first place, it was emphasized by 
the author in a previous paper'*' that an essential difference between clas- 
sical physics and quantum physics lies in the fact that in the latter the 
result of an observation can be used as the initial condition of the 
"state” immediately after the observation but not as the final condition 
of the "state" immediately before the observation. Although this is in 

1. An article contributed by the author to the monograph, Louis de Broglie , 
physicien et penseur (Albin Michel, Paris, 1952) p. 385. 
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agreement with the customary usage of quantum physics* the conscious em- 
phasis on this fact led the author himself to inquire whether one could 

not formulate quantum physics in such a way that the result of an observant 

2 

tion can be used as the "retrodictive state" just before the observation. 
This question will be answered in Section 5 of this paper. Although the an- 
swer is in ihe affirmative, the actual usefulness of such a retrodictive 
theory is extremely limited. 

The second motive stenmed from an enlightening illustration that 
Dr. Keith Symon chose in a conversation to explain the reason why the H- 
theorem cannot be used for the past. He imagines that a man discovers on 
a desk two piles of playing cards, one in a perfect order and another in 
disorder. In spite of the fact that every permutation of cards has the 
same a priori probability, he would not guess that the well-ordered pile 
is a result of shuffling, but he would justifiably infer a selective human 
intervention in the past of the well-ordered pile. Keeping in mind that a 
"permutation of cards" corresponds to a quantum state, "well-ordered-ness" 
and "disordered-ness" to macroscopic cells, and "shuffling" to ergodic 
process, the reader will find that this pattern of inference is given a 
mathematical expression in our formulation of retrod icti on in Section 4. 

3 

Thirdly, everyone familiar with the quantum theory of time-reversal 
is rather disturbed by the fact that the operation of time-reversal is 
not a linear transformation and also by the fact that the operation of 

ft 

2. It is a pleasure of the author to note with thanks that Dr. Adolf Grun- 
baum in a private communication encouraged undertaking clarification of 
this question. 

3. S. Watanabe, Phys.Rev. 84, 1008(1951)« See also, S. Watanabe, Rev. 

Mod. Phys. 27, in press. 
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double time-reversal does not become an identity transformation. One could 
expect that these esthetically unwelcome features of the theory may be 
avoided by a formulation which treats prediction and retrodiction on an 
equal footing. It will be shown in Section 5 that this expectation is 
justified. 

The problem of retrodiction may be formulated in brief as follows; 

An observer B would like to guess from his own experimental data the result 
of another observer A who observed the system some time before B and who 
has not confided his result to B„ The main difficulty for retrodictor B 
arises from the fact that in his retrodictive inference he has to assume, 
apart from his own experimental finding, an a priori probability to each 
possible initial state (in which A might have found the system). There is 
in general no reason to assume an equal a priori probability for each 
quantum state except when the initial ensemble given to A can justifiably 
be assumed to be the result of an ergodic process. It will be explained 
in Section 3 by a simple example how easily a retrodictor can completely 
fail while a predictor cannot fail, mturally in the statistical sense of 
the word. 

However, by assuming the uniform a priori initial probability, one 
can obtain an interesting formalism which exhibits on one hand a complete 
symmetry with respect to the two directions of "time", but which on the 
other manifests a definite one-way-ness of the direction of human "infer- 
ence". In diort, the present paper may be said to be an elaboration in 
the light of quantum physics of the following pregnant words due to W. Gibbs; 

4. J.W. Gibbs, Elementary Principles in Statistical Mechanics (Yale 
University Press, New Haven, 1914) p. 150. 
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"It should not be forgotten* when our ensembles are chosen to illus- 
trate the probabilities of events in the real world* that while the proba- 
bilities of subsequent events may often be determined from the probabili- 
ties of prior events* it is rarely the case that probabilities of prior 
events can be determined from those of subsequent events* for we are rarely 
justified in excluding the consideration of the antecedent probability of 
the prior events," 
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#2. Microscopic Retrodiction 

Let § be a complete set of eigen^statess 






( 2 . 1 ) 



of a family of mutually commuting observables defined with respect to a 
certain physical system* We shall use the same symbol S also to designate 
this family of observables. The completeness of aS> implies that the proba- 
bility p t of the system being found in state -S- t satisfies 



^ P' L • (2.2) 

A family T of mutually commuting observables which do not commute with ^ 
will define another complete set of eigenfunctions. 

The probability that the system which was in state S c at the initial 
instant will be found in state Tj after '£ seconds will be denoted by 

PC^T: = ?(^p , (2.3) 



Sc e S / 



V T 



(2.4) 



where is and T* may or may not be the same complete set. 
assumed completeness , we have 

2 PC L-*j) = 1 . 
d 



On account of the 



(2.5) 



From the invariance of dynamical laws for time-reversal (reversibility) 

or from the invariance for s pac e-and -time-inversion (insersibility) we can 

3 

conclude the inverse normalizations 



2 1 , ( 2 . 6 ) 

L 

This can also be derived from the unitarity of transition matrix. 

Suppose that an observer A observes the system at t = 0 with the ob- 
servable-family •§ $ and that observer B observes the same system at t - 't 
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with the observable-family T . "Prediction" consists in the following posi- 
tion of problem on the part of A. Knowing that observer B will observe with 
T at t -'t , observer A proposes to guess the result of B on the basis of 
his own result. If observer A had the result $- L , then his prediction will 
be that the probability of B obtaining T- will be This means that, 

if observer A prepares a large number U of the cases where the result at 
t—0 was , then NP( ) will be the number of cases where observer B 
will obtain Tj at t = X . Observer A will be called predictor and observer 

Q 

B monitor. 

"Retrodiction" is new to be defined in a close analogy to the pre- 
vious problem, only interchanging the roles of A and B„ Knowing that ob- 
server A observed the system with §> at t -0 , but not knowing what his re- 
sult was, observer B proposes to infer the result of A from his own result 
that the system is found in T: at t = X . B will be called retrodictor and 

o 

A monitor. 

This question does not have a unique answer unless retrodictor B as- 
sumes a certain statistical behavior of monitor A regarding selection of 
the initial states. Independently of his own result at t =r C , retrodictor 
B may have some general information about A, on the basis of which he may 
assume that monitor A has the general habit of selecting (and handing over 
to B) states -S- L with weight uJV (Zu ; = l ). If A prepared a large number tT 
of cases at t>0 , then among them must have been in , according to 
the assumption. At the receiving end,NvJ L P( l~*^) among these will 

turn out to be in T- . The total number of cases which will land in T- 

o 



Z N" w L p ( l -? r ) 

L * 
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will then be 



among whichlTid- P( ) have originated from S>i . Then retrodictor B will 
say that the probability Q ( ■’«- y ) that a system which was found at t = X 
to be in T- had been found in / S- at t-0 is 

_ KuTlPO--^) _ U : Pd^p 
~ z Uvr t ?a^p ' (2o! 

We use the "left-to-right" order to indicate the chronological direction, 

and an arrow to indicate the direction of inference,, 

It should be clearly understood that the above result does not mean 

at all that if A prepared an ensemble with the weight given by (2,8) for 

each £. , then retrodictor B would obtain the result T- „ Indeed, if A 
^ J 

started with the weight distribution (2.8), then B would obtain T k with 
weight ; 

^ W - c B ( i. ) P ( L -» k) 

In other words, (2.8) represents the weight of in- the subset of systems 

ending in T- when the entire ensemble has the weight distribution . In- 
0 

sofar as the estimation of is correct, observer B 5 s retrodition based on 
(2.8) must be statistically successful in this sub-ensemble. If the esti- 
mation of vf is unreliable, all retrodiction is meaningless. 

If retrodictor B does not have any preliminary knowledge about the 
habit of A, the only thing he can do is to resort to the principle of ig- 
norance and to assume that the a priori probability ^ is equal for each 
quentum state S- t . This attitude of B will be successful (verifiable by 
repetition) only if A prepares an ensemble with equal weight for all quan- 
tum states (similar to the raicrocanonical ensemble on an energy shell) and 
if B picks up only those cases which have landed in T- and then classifies 



them according to various possible initial states* According to this simpli- 
fying assumption, (2.8) will become 




(2*9) 



and, further with the help of the inverse normalization (2*6), 



Q 0 U^-p = ? C 



( 2 . 10 ) 



It should be well noted that (2.9) and (2.10) are based on a specific 
assumption that vf- is uniform. In fact, retrodictor B can very easily be 
"fooled" by monitor A. Suppose for instance that there are only two possi- 
ble states (1) and (2) and P( l-> 1 ) = P( 1-^2 ) = P( 2-?i ) = P( 2 -> 2 .) = ^„ No matter 
what ratio ^i/u 2 monitor A may choose, retrodictor B will find one half of 
the cases in state (1) and the other half in state (2). Conversely, ob- 
server B's retrodiction based on the equal distribution that one half of the 
cases must have originated from (l) and the other half from ^2) may be com- 
pletely wrong! monitor A may have handed over to retrodictor B only those 
systems which were found by A to be in (l) at t - o „ The best way to avoid 
this deception on the part of B would be to impose on A, as a rule of the 
game, that he should pick up cases at random from the “microcanical ensemble. 
Then (2.9) or (2.10) will have a meaning in a sub-ensemble which lands in 

Tj . We shall hereinafter refer to the retrodiction based on the uniform 
0 

U*s as a "blind retrodiction." 

Prediction, in contrast to retrodiction, has a simpler rule of games 
the monitor (posterior observer) is required to shew all his results. Then, 
the prediction based on (2.3) will always be statistically successful. It 
should be emphasized that this asymmetry between prediction and retrodiction 
originates from the asymmetry of the "rules of game"? in prediction, the 



predictor has the right to prepare the ensemble , while in retrodiction, 
the monitor has the righto We can easily change the rules to make predic- 
tion just as unreliable as retrodiction 0 Suppose monitor B referring to a 
prediction has the tendency to forget to record some of the cases in such a 
way that the chance of state being recorded by him is proportional to 
^ o Then, the prediction by A will be that the systems registered by him 
as will be recorded by monitor B with the distribution given by 






( 2 . 11 ) 



which offers a nice parallelism to (2.8). We shall however seldom have to 
deal with such a 'forgetful” observer. The "rules of game” must be chosen 
in each case in such a way that they correspond faithfully to the nature of 
the actual description of physical phenomena under consideration. In this 
sense, P( )(2.3) may be used for prediction, but for retrodiction we have 

to use Q.(i*-j. )(2.8) with indeterminate ^ in general case. 

Exception has to be made to the entire consideration of this section 
either if (l) $ and T are the same set and it commutes with the Hamiltonian 
of the system or if (2) S andT are the same and the time duration T is 
zero. In this case, ) - 8tj_ and equation (2.10) follows automatically 

from (2.8) irrespective of the ui”ss 



Q(i*-j) = Kl—p- Zij (2.12) 

Retrodiction is perfectly successful in this special cas@ 0 

The situation in classical physics may be included in this case, if 
the "state” is determined as precisely as possible in principle, i.e„, if 
the system is located at a point in the phase space. 
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#3. Macroscopic Retrodiction 



We shall now introduce the concept of macroscopic cells in our consi- 
deration The macroscopic observations are known to be compatible with one 
another, therefore we can think of a family & of microscopic observables 
which are compatible with all these macroscopic observations,. In general, 
this $ will not be commutable with the exact Hamiltonian of the system, 5 
Suppose further that the eigenstates, S ( , S 2 , etc,, are grouped into macro- 
scopic cells which are labeled by yu*l, 2, , „ in such a way that cell 
contains eigenstates of $ , cell yU-2 contains ft*, eigenstates 
of S , etc, 

A macroscopic prediction consists in inferring the probability of 
finding the system in cell V at t-X when it is known that the system was 
found in cell at t-0 , The answer will be, in terms of the microscopic 
transition probabilities. 



c/*> 



q*.) 

? ck-»v) « Z £ £ (l ^P 
i V L i 



(3.1) 



where 2/ means that L should run over all the eigenstates contained in 

L 

cell , It should be noted that this answer is based on the equal weight 
of /S t within cell ^ , and complete disorder of phase among these states 
In other words, we are taking as the initial state a density matrix (statis- 
tical ensemble) which corresponds to the Hilbert subspace j-L . Writing 
for the projection operator for quantum state , we can express our initial 
ensemble by 









(3.2) 



This is the best we can do under the given information that the system was 



5. The exact Hamiltonian may commute with the '’macroscopic energy” but not 



with the other macroscopic quantities. J„v„ Neumann, ZS, f.Phys, 57,30 
(1929). • "H= 



found in at t=0 . (3.1) is of course normalized with regard to V * i.e.* 

Jjv P (-^ V s ) — i . 

Now the retrodiction consists in inferring the probability that the 
system had been found in macroscopic state ^A at t-O * when it is known 

that the system was found to be in macroscopic state V at • Again 

introducing the a priori probability t^.for each quantum state in cell yu. , 

one will answer that the probability in question is given by 



'uTm.tu'P (u-»v) 

QC/x^-V) - L 



1 ^) CV) 

^ ? ( L **<p 

OO (V) ^ , v 



(3.3) 



If we can assume that the background ensemble of A was a microcanonical 
ensemble* i.e.* if all the vr's are equal* then we can simplify (3.3) to 
the forms* 



q 0 C = CYl y*/ny) PCyU-? 7^) 



(3.4) 



where the inverse normalization (2.4) has been utilized. The probability 
(3.3) or (3.4) satisfies the normalization condition with regard to yx_ : 

2/U Q L^<r- P ) = 0 . 

It might appear as if prediction in the macroscopic ease were equally 
unreliable as retrodiction since we have to use the assumption of equal 
probability (within cell y/. ) also for prediction here. However* it should 
not be forgotten that it is observer-predictor A himself who prepares the 
initial ensemble* therefore unless he puts uneven* selective weights to vari- 
ous states within cell ^a. he can succeed. On the other hand* the supposed 
even weight all over the energy shell assumed in (3.4) is not in the control 
of observer-retrodictor B* who therefore can very easily fail in his retro- 
diction. 

Let us next examine the consequences of reversibility (invariance for 
time-reversal) and inversibility (invariance for space-and-time inversion) 
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on our problem. The reversed state S' of a state 5 means the one in which 
all the particles have the same positions as in S but the equal and opposite 
velocities to those in *5 The inversed state S' of a state <S means the 
one in which all the particles have the same velocities as in S but the 

_ t 

space-inverted positions as compared with & . Reversibility and inversi- 
bility, which hold in the basic processes in quantum mechanics, then means 



PCS — T) = PCXes'; . (3.5) 

The macroscopic observations usually^ cannot distinguish a state from 
its reversed or inversed state. In other words, a cellyU. contains the re- 
versed, as well as inversed, state S ' of a state S if it contains S , Then 
fran (3.1), we obtain in virtue of (3.5)$ 






(3.6) 



Using this relation, we can write (3.3) in the forms 



aX, p CV -*/0 

(AA^V) = r 

1 2^^ p (^/O 



(3.7) 



and with the assumption of uniform weight, 

V ) - PC1P—/A.) (3.8) 

which has a striking simplicity , (3.4) and (3.8) are applicable only to the 

"blind" macroscopic retrodiction. 

What has been developed in this section also applies to classical, sta- 
tistical considerations if we replace the number of quantum states by the 
volume in the phase space. 



6, For the precise definition of time-reversal and space-and-time inversion, 
see Sections 3<j4, Part I, and Section 3# Part II, S,Watanabe, Rev,Mod,Phys, 
27 , in press. 

7. This is certainly the case if we limit the macrosconic quantities to a 

certain category, for instance to the thermodynamical variables. 
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#4. Application of the H-Theorem to the Past 



The probability P( V ) depends naturally on the length of the in- 
terval X between the two observations. The ergodic H-theorem, in essence, 
states that if the exact Hamiltonian does not commute with the observable- 
family § which is compatible with the macroscopic observations, then the 
probability P( -* yiA.) averaged over possible values of is proportional 
to the size of the final cell yU. % 

<p cv-/a)> = (4.u 

where N" is the total number of quantum states on the energy shell. 

Eq. (4.1) shows that if we take a value of X arbitrarily from its 
possible domain: o< X<°° , the probability P( -*^4.) of finding the sys- 
tem in a large cell ^ is large. Invoking now the relation (3.8), we can 
say that on the assumption of blind retrodiction, the probability ) 

that the system had been found at t'-'C in a large cell yu is also large if 
— 7T is arbitrarily taken from its possible domain -o c<.-lz <0 . If we use 
the Boltzmannian entropy -$g : 

(4.2) 

8. Although the non-co mmut ability of the exact Hamiltonian with & is the 
main hypothesis, we need some more auxiliary conditions to derive this 
result. For the two versions of these conditions, see J. von Neumann, 

ZS. f. Phys., 106 , 57 , 30(1929); and W. Pauli and M. Fierz, ZS. f. Phys. 
106, 572 (1937). 

9. The time-average in v. Neumann's proof can be only for the positive 
values of X . It should be noted also that the H-theorem considered 

here refers to one initial observation (t = o ) and one final observa- 
tion (t-'C) and is different from the consideration based on repeated 
observations. See Section 7, Part I, S.Watanabe, Rev. Mod. Phys. 27 in 



-li, 



r 



press 



we can say on the basis of blind retrodiction that if at t=o has a cer- 
tain non-maximum value then it is just as probable to have a larger entropy 
value in the future as in the past# This is the well-known conclusion of a 
formal application of the H-theorem to the past. We could also use the 
Gibbsian entropy ; 



4 ^^)- - ti /. ?(»-»/*> t po -*-/-* 5 /'V 3 



with T>o , but it may be easier to visualize the situation with the help of 
tile Boltzmannian entropy. 

The above argument is based on the premise of blind retrodiction which 
may be the only possible basis of inference if it is perfectly certain that 
the system had been isolated from the exterior system (except a possible 
prior observer who does not perform any kind of selection) and if we have 
absolutely no other information about the system than that it was found in 
V at t-0o However, such conditions are seldom satisfied in the actual 
circumstances . A sounder inference than the mere blind retrodiction, in line 
with Symon » s idea explained in our Section 1, would be somewhat as follows; 

Consider two cells /^andV such that n,? . Then according to (3.6), 
we have 



P C /* v) __ 2}2L « 
p tv 



r 



and according to (4.1) we have also 



(4.5) 



<P C 



« i 
n. 



(x., -X arbitrary ) 



<PC*-/0> 7* 

Now let us assume that we find a system at t=o in cell V „ Seeing from 



(4.6) 



(4,5) that it, is extremely unprobable for a system starting from a large cell 
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1^. to reach a small cell V , we suspect that such was not the actual history 
behind the system we have just found in V . This inference is a direct con- 
tradiction to the result mentioned above based on the uniform UT. Thus we 
are led to modify the assumption of uniform ^ in such a manner as to give 
less weight to larger cells and larger weight to smaller cells. Such 
an assumption of non-uniform ^ is perfectly allowable according to our 
theory. In fact, for instance, if there is any possible doubt about the 
isolation of the system in the past, there is no reason to adopt the hypo- 
thesis of blind retrodiction. Then the result of observation that the sys- 
tem was found in a small cell at present can very well reflect itself in our 
estimation of the id's. 

Once we have abandoned the assumption of uniform \f , we cannot use 
(3.8) any longer and have to go back to (3.7). In spite of the fact that 

may be large, Q. (^A*-V ) can be small if ^ is small in (3.7). And 
the probability of the system having originated from a small cell can become 
quite large. Thus the entropy value at t-'-'tmay probably have been 
smaller. Our formalism is flexible enough to incorporate this very reasona- 
ble inference. 

The above argument can be applied also to the classical, Statistical 
mechanics. It is interesting to note how our argument can stand the famous 
objection due to Loschmidt. It is true that in cell V there are just as 
many microstates headed for larger values of entropy - 83 , in the future as 
those which have originated from larger values of entropy in the past. If 
VT is uniform, then each microstate inside cell ~\) will be occupied by the 
same weight on account of the permanence of the micro canonical ensemble. 
Then, Loschmidt' s argument becomes valid, and we have to conclude larger 



values of entropy just as well for the future as for the past. But if ur 
is not necessarily uniform, then we need not assume equal weight for each 
microstate inside the cell for the purpose of extrapolation towards the 
past. Thai Loschmidt's objection does not hold any longer. For a realis- 
tic macroscopic retrodiction, we should not use the uniform weight within 
the macroscopic cell V , while it may be assumed for prediction. 

It is interesting to note that the blind application of the ergodic 
H-theorem to the past does not actually yield any newer information than 
what one has put in as the assumption. For the combination of (4.1) and 
(3.8) gives 



> - "v/ ^ 



(4.7) 



which is nothing but the expression of a uniform probability, an assump- 
tion which has been used in deriving (3.8). 
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#5. Retrodictive Quantum Mechanics 



Our basic equation (2.8) for retrodiction can be written as 



QU^') = ■uf L Q fl CL*-p/r,- t vj.Q.CL«e- ( |:) y 



(5.1) 



with the help of (2.10): 



Q, ~ PC 



(5.2) 



where the uf- depends on the over-all judgment of the retrodictor. Only 
when the system has been isolated in the past and there is no other clue to 
the past history of the system than the observational fact that the system 
is found at present in state ^ , then the retrodictor will use the uniform 
value of \f- L for various [,' s and Q ( ) will reduce to Q 0 ( I ). w hat 
follows mainly concerns the blind retrodiction represented by Q 0 , but Q 
can be derived from Q. by the use of (5.1) if there is any way of estimating 



In this section, we shall first show that the quantity given in (5.2) 
can be calculated in two ways: either solving the Schrodinger equation with 

the initial condition Bi , or solving the same equation with the final con- 



first method agrees better with the idea suggested by the right hand side 
of (5.2), while the second method reflects more faithfully the idea sug- 
gested by the left hand side. Since the first method is the customary one, 
we shall only show how the second method can be used to evaluate the same 
probability. 

let the eigenfunctions of $ be called > ‘fc, •• and those of T 1 

•vb -vp .... Let further the solution of the Schrodinger equations 








= - l H(t)£(t) 



(5.3) 
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satisfying the final condition: 



(5.4) 

be denoted by ^Ct) . Expanding (f T (p) according to the 'f L : 

V(0)= Z^atfz (5.5) 

we can easily show that Clta^ represents the probability (5.2). 

Consider the transition matrix defined by: 

^UCt^tO/at-, =-f Htb,)U(t„tO , ^VCtM/iU- (5.6) 

U(t,,t, ) = 1 } iriXitU) = ir H Ct <# ti) = uct 1y t t ). (5,7 ‘ ) 

Then according to the customary theory, the probability (5.2) is given by 



?Q->P = I , l/C't/O) qp : '>\ z . (5.8) 

On the other hand, $ T (t) considered above is 

£ r (t)= (TCt^)^ (5.9) 

and the coefficients are 

Q u = VCO,tr) ) (5.10) 

On account of the unitarity of U and of the relation U (o,t ) = U '( T,o), 
(5.7), we obtain 

a,= C f?Co,t j<p t> tj ) 

= ,^)= ttj , v (x, ojf ; f (5 . U ) 

Hence, in view of (5*8), 

a? a, = ? a -j ) = C; *- J ) . ( 5 . 12 ) 



This situation suggests a new picture of the " state” of a system be- 
tween two observations, one at t = 0 and the other at t=Z ° There exist 
simultaneously two states, one being a predictive state ^(t) which com- 
plies with the initial condition at t=0 , and the other a retrodictive state 
■$V(t) which complies with the final condition at t'T » Both idp(t) and 
$ r ( t) obey the same Schrodinger equation. This picture, though redundant 
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in practical applications, offers certain intellectual interest for it pro- 
vides a complete symmetry between two consecutive observations. 

Now, we should like to look upon the same situation from a slightly 
different point of view, namely we attempt to establish a time-symmetry, 
not with regard to two observations, but with regard to the future and past 
referring to a single observation at hand. Suppose we make an observation 
at t=0 and obtain a result ^ . Then our inference will develop towards 
the future just as well as towards the past. Let us introduce a new varia- 
ble S , called inference parameter, which coincides with t when it refers 
to prediction, and which is equal to minus t when it refers to retrodiction. 

$> is then always positive. ’ 

The development of a retrodictive state $V (t ) starting backward from 
c p^ at t-0 is nothing but the extrapolation of the predictive state and 
obeys 

=-LHCO$ r (t) , (5.13) 

or in terms of s , 

As = tUl(-s)$ r (-0 . (5.14) 

The Hamiltonian being hermitian, the complex conjugate of (5.14) becomes 

^ y C-s3/$s = - L^yC-OHC-O (5.15) 

Introducing a time-independent unitary operator R , called reversion opera- 
tor,^* such that 

( R -1 H(-sK = HCO (5.16) 

we can rewrite (5.15) in the forms 

-3 = -iHCO (5>17) 

This means that ^ defined by 

$(s) = R t C-O ^ r (-s)= (5.18) 
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satisfies the same equation as the predictive state: 

di(0/&s = -iH(O §(0 . (5.19) 

The only difference is that satisfies the initial condition: 

i(0) = R r ft , (5.20) 

while the predictive state ^(s) satisfies 

= . ( 5 . 21 ) 

The probability of finding this system at t - s >o in state will be 

Kl-*p = (C+j ,^(0)1* (5.22) 

while the blind probability that the grstem had been found at t>-S<Oin state 
will be 

SoC j v.) = l ( | 

i - 5<s^r. <5-23) 

In brief, the two inferential states "^(s ) and ^.(.5) can be treated in 
a parallel fashion, only using R T( j>* for § ( s ) wherever we would use $ for 
(s). Compare (5.20) and (5.23) respectively with (5.21) and (5.22). It 
would then be a tempting idea to introduce a quantity which comprises both 
5^ and £ on the same footing. A "double inferential state" composed of 
two components: 



will obey the Schrodinger equation: 

^(s)/as = - a HI CO '5' ^ 



with 



H CO = f H ^ 0 



(5.24) 



(5.25) 



(5.26) 



The initial condition of is 



and the solution of ( 5 *25 ) at an arbitrary value of s will then have the 

form! ^( S ) =. ( 5<s) A 

' ) . (5.28) 

We can however liberalize the relationship between the two components of 
(5.28) without affecting their physical meaning. Namely, taking any unitary 
operator^ ^which commutes with all the known physical quantities, we can 
write, instead of (5.28), 

*(S) = ( * (0 ^ 

\ / 



(5.29) 



This amounts to replacing the retrodictive state $ r (-s) by W , 

which of course does not change the content of a retrodiction. The initial 
condition of (5.29) is then 



^•(o) = ( * l 

\ R 1 w* f* 

To make our discussion more concrete let us take as W 



W - A 



n. 



where KVin any arbitrary integer and A is given bjA° 



*8t 



.N; 



(5.30) 



(5.31) 



(5.32) 



A =&* - A 1 = A* = R r -ff' =• TTu C" 0 

in which N\, is the occupation number operator for the spinor eigenstate 
labeled L . 4 is known to commute with the reversion operator, R . Then 

general pattern of a double inferential function is 

, C s >o) , 



W R £ ^(-s) 



(5.33) 



with arbitrary . A K is unity when IT. is even. 

We can now introduce the "reversed" inferential function of (5.33) 



by 



= ( 



A n R r $*(-s) 

£>(»> 



Cs> o) , 



(5.34) 



10. See Section 12, Part II, S. Watanabe, Rev.Mod.Phys. 27, in press. 
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which certainly falls in the supposed general pattern of an Inferential 
function (5.33), only the arbitrary number Yi being replaced by TH-J * for 

5(0= a" +i R r (4’V$*(-0)* (5.35) 

Furthermore, at each value of $ , the first arri the second components of 
(5.34) represent respectively the so-called "reversed states" of the first 
and the second components of (5.33).^ Indeed, for a given state ^(t - ), its 
reversed state can be expressed by W R~ ( - t). The transformation from 
(5.33) to (5.34) can be written 

« r ( 0=R1ECO y -H-fcR&O , (5. 36 ) 



with 



R - 






(5.37) 



The formalism presented here has no practical advantage over the current 
quentum theory, but it has a formal advantage in that time-reversal is repre- 
sented here by a linear transformation (5.37) and double time- reversal^ be- 
comes an identity transformat ions 

R z — 1 . (5.38) 
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# 6 . Irreversibility of Inference and Information 

Suppose observer A prepares a large number K of systems which were found 
at t-0 in state ^ „ After 'C seconds, each system will become <p. . 

If observer B performs at t=' C an observation with the complete set T ^ ..J , 
then K ^ (j (x,o) <jj> L ) |~ systems will turn out to be in state ^ . ^ith 
the help of projection operators we can write this process in the fol- 

lowing schemas 

= % - 'VCv {*,0) fij -*■ 

= ? left, . (6-1) 

The amount of "information"^ carried by the knowledge about the system 
represented by is 

I = Sp*. (<g. ) + owiT. ( 6 . 2 ) 

This quantity does not change in the first step of transition in (6.1), but 
does decrease in the second step. This is the famous irreversibility of ob- 
servation pointed out by von Neumann.^" It should be noted that ^3 in 
(6.1) does not represent the knowledge obtained by observer B in individual 
cases, for in each case observer B knows perfectly well in which one of the 

11. C. Shannon and ¥. Weaver, Mathematical Theory of Communication (University 
of Illinois Press, Urbana, 1949). We do not indulge here in the discus- 
sion regarding the sign before the Spur and regarding the constant in 
( 6 . 2 ). The quantity ( 6 . 2 ) was first used by von Neumann, Mathematische 
Srandlagen der Quant enmechanik (Julius Springer, Berlin, 1932). See also 
L. Szilard, ZS. f. Phys. 53, 840(1929). For an early application of the 
quantity (6.2) to a concrete physical problem, see S. Watanabe, ZS. f. 
Phys. 113, 482(1939). 
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Y 1 s the system is found. cam be considered as a global description of 
the entire ensemble after the observation, or as the prediction of the re- 
sult of B in each case. 

Next, suppose that observer A prepares at t=0 a large number of sys- 
tmes with equal weight in all possible ^ 1 s ( £ & ). Observer B performs 
at an observation with T 3 , and a certain large number IT of systems 

is found to be in state Yi . He considers now only those systems ending 
in , and ask what percent of them had been registered as by bhe P re “ 
vious observer A. Then, he extrapolates '■h backward by the Schrodinger 
equation from t-'t to t= o , and calculates I C cp , \J Co, t) ) I*" • His 

inference will then be that, among N systems that he found in vj/. f 

Q 

IT I , lT(o, T ) YjL )*" systems must have been found by A to be in 

Yc • Schematically, this inference can be denoted ty 



= Silc<f L ,irco,'t)'|'pl i ‘?£ < i , d «- 
= ‘PtVCO/'O+i] *- g/ = £ ■fp , 

which exhibits a parallelism to (6.1). in (6.3) represents a partial 

ensemble immersed in the uniform ensembles 



g. = ‘xSc'P CfO (M> 

prepared by A. If A would have started with Cjf , then B would not obtain 
. Nonetheless, represents the legitimate inference made by B 

based on the blind retrodiction hypothesis with regard to the results that 
A had obtained in the systems which were later found by B in *p: . 

If B has any further source of judgment about the initial ensemble, 
he will modify the assumption of uniform weight, and attach a reappraised a 
priori probability to each this case, will become 

= (St U?t, V(o ( T) fp l i, ?C<fc3} / /{2 l vr-J(Y-,U(o / T)Y i )| 2 j ( 6e5) 
in accordance with (2.8) or (5.1) 
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It is evident that, no matter whether one uses of (6.3) or that 

of (6.5), the amount of information carried by is smaller than that car- 
ried by C j/ , i.e., the decrease of information here takes place in the back- 
ward direction of time. Both the case of prediction (6.1) and the case of 
retrodiction (6.3) or (6.5) can, however, be included in the statement that 
the amount of information decreases in the direction of inference, i.e., in 
the positive direction of the inference parameter of the last section. This 
last result is in a good agreement with the common sense, for an inference 
cannot contain more information than the fact from which the inference is 
drawn. 

In the statements in the foregoing, the phrase "information decreases" 
must be replaced by "information remains constant" in the following two cases s 
When (l) and Tare the same set and the elapse of time ^ is zero, or 
(2) § and T are the same set and commute with the Hamiltonian of the system. 



— 26 — 



1 



llord 



axflorc 

ASE BINDER 

Syracuse, N. Y. 
Stockton, Calif. 




TA7 27446 

•U6 ifetanabe 

no. 6 Symmetry of physical 
laws, Part 3. 




TA7 27446 



.u6 Vvatanabe 

no.b Symmetry of ohysicel 
lavs. Part 3- 




